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cesses with Lipschitz and/or dissipative coefficients and apply them to some singular stochastic differential 
equations of interest. 



Keywords: Entropy, (multi-valued) stochastic differential equations, dissipative functions, transport 
inequality, Wasserstein distance. 



1. Introduction 

Let (W,d) be a separable Frechet space, for two probability measures Pand Q on (W, B(W)), then 
the Wasserstein distance (cf. [17]) between P and Q, denoted as dw{P, Q), is defined as 

d 2 w (P,Q) =inf ( f d(x,y) 2 6(dx,dy) : 9e E(P,Q)1 , 

{JWxW ) 

where S(P, Q) denotes the set of probability measures on W x W whose first marginal is P and 
the second one is Q; note that this is a compact set under the weak topology, hence the infimum is 
always attained for any d (even lower semi-continuous). It is quite useful to find an upper bound 
for this distance, if possible dimension independent. There are a lot of works on this subject (cf. 
[17]). beginning by the contributions of M. Talagrand, cf. [10 , where it is shown that the relative 
entropy is a fully satisfactory upper bound. In [5J [5] , it is shown that the relative entropy is again 
an upper bound when P is the Wiener measure and d is the singular Cameron-Martin distance 
using the Girsanov theorem (cf. also [1]). The same method has also been employed in [TB] and 
more recently in [H] to obtain a transportation cost inequality w.r. to Banach norm for diffusion 
processes. The former assumes quite strong conditions on the coefficients which govern the diffusion 
which are superfluous and make difficult the applicability of the inequality, while the latter one treats 
essentially the one-dimensional case with an extension to the case where the diffusion coefficients are 
independent and their slight perturbations. Inspired with these works, we have attacked the general 
case: namely, the case of fully dependent diffusion like processes and their extensions and infinite 
dimensional diffusion processes governed with a cylindrical Brownian motion. Besides, there is a 
special class of diffusion processes with singular (dissipative) drifts which are constructed as weak 
limits of the Lipschitzian case where the approximating diffusions have Lipschitz continuous drifts 
but the Lipschitz constant explodes at the limit; this last class is particularly interesting because of 
their applications to physics. 
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To achieve this program, we need the following result about the stability of the transportation cost 
inequality under the weak limits of probability measures, which is proved by Djellout, Guillin and 
Wu in [4;. Since we make an important use of it, we give it with a (slightly different and more 
general) proof. 

Lemma 1. Assume that (P k ,k > 1) is a sequence of probability measures on a separable Frechet 
space (W,d), converging weakly to a probability P. If 

d 2 w {Q, P k ) < c k f l0 S TiT dP * = c kH(Q\P k ) 
J w dP k dP k 

for any k > 1, for any probability Q, where c k > are bounded constants, then the transportation 
inequality holds for P, namely 

(1.1) dw(Q,P) < cH(Q\P) , 

where c = sup fe c k . 

Proof: If / = dQ/dP is a bounded, continuous function, then the inequality (jl.ip follows from the 
lower semi continuity of the transportation cost w.r. to the weak convergence and from the hypothesis 
since / log / is continuous and bounded. Due to the dominated convergence theorem, to prove the 
general case, it suffices to prove the case where / is P-essentially bounded and measurable. In this 
case, there exists a sequence of bounded, upper semi continuous functions, say (/„, n > 1), increasing 
to / P- almost surely. By the dominated convergence theorem, the measures (f n dP, n > 1) converge 
weakly to the measure fdP, where /„ = f/P(f n )- On the other hand H{f n dP\dP) H{fdP\P) 
again by the dominated convergence theorem. Hence, to prove the general case, it is sufficient to 
prove the inequality with / upper semi continuous and bounded. Since we are on a Frechet space, 
there exists a sequence of (positive) continuous functions decreasing to / which may be chosen 
uniformly bounded by taking the minimum of each with the upper bound of f, and the inequality 
(|l.ip follows again due to the dominated convergence theorem. □ 



2. Diffusion type processes with Lipschitz coefficients 

Let (W,H,n) be the classical Wiener space, i.e., W = C ([0, 1], H d ), H = iP([0, 1], H d ) and is 
the Wiener measure under which the evaluation map at t G [0, 1] is a Brownian motion. Suppose 
that X — (Xt,t £ [0, 1]) is the solution of the following SDE (stochastic differential equation) 

dX t = a{t,X t )dWt + b{t,X)dt 
X = zeTR d 

where a : [0, 1] x TR d — > <g>TR d is uniformly Lipschitz w.r. to x with a Lipschitz constant being equal 
to K, b : [0, 1] x W — > lR d is adapted and such that 

\b(t,0 - b(t, v )\ < ^ sup |C( S ) - 7,001 = U - v\U 

s<t 

for any £, 77 G W. We denote by dw the Wasserstein distance on the probability measures on W 
defined by the uniform norm: 

dw(p> v ) = inf / \\ x ~ y\\ 2d i{ x i v) ■ 7 e E(yo, v) 

\JwxW 
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where S(p, v) the set of probabilities on W x W whose first marginals are p and the second one is 
v. We have the following bound for dw- 

Theorem 1. Let P be the law of the solution of the SDE described above, then for any probability 
Q on (W,B(W)), we have 



where H(Q\P) is the relative entropy of Q w.r. to P. 

Proof: Due to the rotation invariance of the Wiener measure, we can suppose without loss of 
generality that a takes its values in the set of positive matrices. Suppose first that a is strictly 
elliptic. From the general results about the SDE (cf. [3 [5]), the coordinate process x under the 
probability P can be written as 



need the following result, which is probably well-known (cf. [5] and the references there), though we 
include its proof for the sake of completeness: 

Lemma 2. Any bounded P -martingale can be written as a stochastic integral w.r. to (3 of an adapted 
process (a s ,s € [0, 1]), with Ep L \a s \ 2 ds < oo. 

Proof: Let us denote by P° the law of the solution of 

dX t = a(t,X t )dW t , 
then under P°, the coordinate process x can be written as 

dx = a(t, Xt)d0t , 

where /3 is a P°-Brownian motion. Let Z be a bounded P-martingale with Zq = 0, assume that 
it is orthogonal to the Hilbert space of P-square integrable martingales written as the stochastic 
integrals w.r. to (3 of the adapted processes. Let M be the exponential martingale defined as 



Then, we know from the uniqueness and the Girsanov theorem that MdP = dP , since M can be 
written as a stochastic integral w.r. to /?, our hypothesis implies that ZM is again a P-martingale, 
hence Z is a P°-martingale, therefore, from the classical Markov case it can be written as 



(2.2) 



d 2 w (P,Q)<6e 15K2 H{Q\P) 



dxt = o~(t, Xtjdfit + b(t, x)dt 
with xq = z P-a.s., where /3 is an IR^-valued P-Brownian motion. At this point of the proof we 



M t = exp 





This last expression implies that 




but Z is orthogonal to the stochastic integrals of the form J a s .d/3 s , hence Z t = Ep[Z t ] = 0, which 
proves the claim. □ 
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Let us complete now the proof of the theorem: If Q is singular w.r. to P, then there is nothing to 
prove due to the definition of the entropy. Let L be the Radon-Nikodym derivative dQ/dP, we shall 
first suppose that L > P-a.s. In this case we can write 

L = p(~6v) , 

where v(t,x) = J*v s (x)ds, v s (x) is a.s. adapted and f Q \v s (x)\ 2 ds < oo a.s. and Sv = J Q v s dp s . 
From the Girsanov theorem, z t — (3t + f v s ds is Q-Brownian motion, hence by the uniqueness of 
the solution of SDE, if we denote by x v the solution of the SDE given as 

dx v t =a(t,x v t )dz t + b t {x v )dt 

the image of Q under the solution map x v is equal to P, consequently (x v x I\v)(Q) & S(P, Q), 
hence we have the following domination: 

d 2 w (P,Q) < E Q [\\x" - x\\ 2 } 

where || • || denotes the uniform norm on W. Using Doob and Holder inequalities, we get 

E Q [sup\x v r -x r \ 2 } < (12 + 3t)K 2 E Q f \x v s -x s \ 2 ds 

r<t Jo 

+UE Q f \v s \ 2 ds. 
Jo 

It follows from the Gronwall lemma that 



since 



E Q [snp\x v r - x r \ 2 } <3tE Q f \v s \ 2 ds e 3 ^ 2 ( 4 +*) 

r<t Jo 

E Q f \v s \ 2 ds = 2H{Q\P) 
Jo 



the claim follows in the case P ~ Q. For the case where Q <C P let 

_ L + e 

.Lip — 



l + e ' 

then it is easy to see that (L e \ogL £ , e < sq) is P- uniformly integrable provided Ep[L log L] < oo. 
Hence the proof, in the strictly elliptic case, follows by the lower semi-continuity of Q — > dyy(P, Q). 
The general case follows by replacing a by eI^l + a, then remarking that the corresponding proba- 
bilities (P e ,e < Eo) converge weakly and that 

d 2 w (Pe,Q)<6e me+K)2 H(Q\P £ ) 

and hence it follows from Lemma Q] that 

d 2 w (P,Q) <6e 15K2 H(Q\P). 

□ 



Since the inequality (|2.2j) is dimension independent, we can extend it easily to the infinite dimensional 
case: 
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Corollary 1. Let M be a separable Hilbert space, suppose that B is a M -cylindrical Wiener process. 
Assume that a : [0,1] x M — > L^iM^KM = M <g>2 M (space of Hilbert- Schmidt operators on M) 
and b : [0, 1] x M — > M are uniformly Lipschitz with Lipschitz constant K . Let P be the law of the 
following SDE: 

dX t = a(t, X t )dB t + b(t, X t )dt , X = x € M . 
Then the law of P satisfies the transportation cost inequality \2.2\) . 

Proof: Let (ir n ,n > 1) be an sequence of orthogonal projections of M increasing to the identity, 
define er„ = ir n <j o 7r„, b n = n n b o 7r„, B n = n n B and x n = ir n x. Let then P n be the law of the SDE 

dX? = a n (t, X?)dB? + b n (t, X?)dt , X™ = x n . 

From Theorem[Tl P n satisfies the inequality (|2.2I) with a constant independent of n, since {P n , n > 1) 
converges weakly to P, the proof follows from Lemma [TJ □ 

3. Transport inequality for the monotone case 

Assume that the Lipschitz property of the adapted drift coefficient is replaced by the following 
dissipativity hypothesis 

{b{t,x)-b{t,y) 1 x t -y t )<Q 
for any t G [0,1] and x,y € W, where, as before (•,•) denotes the scalar product in H d . The 
derivative of a proper concave function on IR'' is a typical example of such drift. We shall suppose 
first that 

/ \b(t,x)\ 2 ds < oo 
Jo 

for any x € W. 

Proposition 1. Assume that b is of linear growth, i.e., \b(t, x)\ < N(l + ||a;||) and let P be the law 
of the solution of the following SDE 

(3.3) dX t = a(t, X t )dW t + b(t, X)dt + m{t, X t )dt 

with Xq = x G H d and that a and m : [0, 1] x lR d — » JR d are uniformly K -Lipschitz w.r. to the space 
variable. Then for any Q <C P, we have 

(3.4) 4r(P,Q) < (c2 3 / 2 ||a|| 3 0O /2 e^ K2+2K+1) ) 

+2|M| 0O e^ 2+2A '+ 1 > (l + K(K + 2)) e ^ 2+2A+1 >) H(Q\P) , 

where ||cr||oo is a uniform bound for a, K is the Lipschitz constant and c is the universal constant 
of Davis ' inequality for p = 1 . 

Proof: Recall that under P, the coordinate process satisfies dx = a(t, Xt)d/3 + (b(t, x) + m(t, xt))dt, 
where f3 is a P-Brownian motion. Assume that Q is another probability on W such that Q <C P, 
let L be dQ/dP. Suppose first that L > P-almost surely. As explained in the first section, we can 
write L as an exponential martingale L = p(—5v), then x v (Q) — P, where x v is defined as before: 
dx v — <j(t, Xt)(d/3 t + v t dt) + b(t,x v )dt + m(t, x^)dt. Again by the uniqueness of the solutions, we 
have (x v x Iw)(Q) G S (P, Q), hence 

d 2 w (P,Q)<E Q [\\x v -x\\ 2 ). 
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It follows from the Ito formula, letting dz — d/3 + vdt, that 



\x v t -x t \ z = 2 / (x v s -x s ,dx v s -dx s ) + / \a(s,x:)-a(s,x s )\ z ds 



= 2 {x v s -x s ,b{s,x v )-b{s,x))ds 
Jo 

+2 / (x v s - x s ,(cr(s,x v s ) - a(s,x s ))dz s + (m(s,x v s ) - m(s,x s ))ds) 
Jo 

+ / \a[s,x° a ) - <r(s,x s )\ 2 ds - 2 / (a£ - x s , cr(s, x^)v s )ds . 
Jo Jo 

By the dissipative character of b, we get 

\x" t - x t \ 2 < 2/ (Xg - z s , (<r(s,Xg) - cr(s, x a ))dz s + (m(s, x^) - m(s,x s ))ds) 



+ / \cr(s,x v s ) - <t(s,x s )\ ds-2 (x v s - x s , cr(s, x v s )v s ))ds . 
Jo Jo 

Using, the usual stopping techniques, we can suppose that the stochastic integral has zero expectation 
and taking the Q-expectation of both sides, we obtain 

E Q [\x v t - x t \ 2 ] < (2K + K 2 )E [ \x v s ~x s \ 2 ds 

Jo 

+2\\a\\<~E \x v a — xJlvJds 



using the inequality xy < S(x 2 /2) + (y 2 /2d), we get 

E Q [\x\ - x t \ 2 ] < (2K + K 2 + SWaWDE J* \x v s - x s \ 2 ds + +~H t (Q\P) , 

where S > is arbitrary and Ht{Q\P) — J log ^jp\r t dQ is the entropy for the horizon [0,i], which is 
an increasing function of t. It follows from the Gronwall lemma that 



(3.5) 



E Q [\x v t - x t \ 2 ] < -H t (Q\P)exp [t(2K + K 2 + <%||L)] 



Using now the Davis' inequality, the Lipschitz property and the boundedness of a, we get 



.EIsupK-avl 2 ] < (2c|| < 7|| 00 + V^HtiQlP) 1 / 2 ^ 



r<t 



\x^, — x s \ 2 ds 



1/2 



-K(K + 2)E [ \x v s -x s \ 2 ds, 
Jo 



where c is the universal constant of Davis' inequality. Note that the right hand side of the inequality 
(|3.5p is monotone increasing in t, we insert it to the above inequality and minimize it w.r. to 5 for 
t = 1 and the proof is completed. □ 



In fact we have another version of the inequality Q3.5P in the case where a is not bounded but still 
if-Lipschitz: 
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Proposition 2. Assume that all the hypothesis of Proposition^ are satisfied except the boundedness 
of a which appears in the SDE \3.3]) , then we have the following transportation cost inequality: 

(3.6) dUP, Q) < H(Q\P) ^-L^ exp + 1 - acK + 2K + ) 

where P is the law of the SDE \3. 3\) , Q is any other probability and a > is arbitrary provided that 
acK < 1. 

Proof: The proof is somewhat similar to the proof of Proposition [TJ in fact we control uniformly 
the stochastic integral term in the ltd development of |x" — Xt\ 2 as follows: 

(x v s - X s , (cr(s, X v s ) - cr(s, X s )dz s ) 

\x v s - x s \ 2 \a(s,x v s ) - o-(s,x s ) \ 2 <l>; 

, x 1 .2 

\Xg — x s \ ds 



E 


sup 


1 




_r<t 


Jo 



Hence we get 
E 



< 



sup \ x s — x s 

s<t 



< cE 

< cKE 

< cKE 
caK 



t \ 1/2' 

2 / I J ^ 1 2 



sup|x^-a; s | / \x v s -x s \ 

s<t JO 



-E 



sup \x s — x s 

s<t 



Xe\ 2 ds . 



< acKE 



sup \x s — x s 

s<t 



a Jo 



x*\ 2 ds 



-{2K + K' z + 5)E J \x v s -x s \ 2 ds + -E J \v s \ 2 ds, 
where a, 5 > are arbitrary, c is the constant of Davis' inequality. From above, we obtain 



1 



(1 - acK)E 



sup \x s — x s 

s<t 



< 



— +2K + K 2 +S\E [ \x v . - xJ 2 ds 



-Mt(Q\P) 



and Gronwall lemma implies that 



E 



sup \x s — x s 

s<t 



< 



5(1 - acK) 
t 



■ exp 



Ht(Q\P) 
cK 



5 + 2K + K 2 



1 — acK \ a 

Taking t — 1 and minimizing the r.h.s. of the last inequality w.r. to 5 completes the proof. 



□ 



It is important to notice that we did not use any regularity property about b except that the 
intcgrability of t — > b(t, x) for almost all x in an intermediate step. This observation means that we 
can deal with very singular drifts provided that they are dissipative. Let us give an application of 
Proposition Q] to multi- valued SDE (cf. Q]) from this point of view 
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Theorem 2. Let P be the law of the process which is the solution of the following multi-valued 
stochastic differential equation: 

m{X t )dt + a(t, X t )dW t £ dX t + A(X t )dt , X a = x e D{A) , 

where A is a maximal, monotone set- valued function (hence — A is dissipative) , such that Int(D(A)) ^ 
0. Assume that a and m are uniformly K-Lipschitz and that a is bounded. Then P satisfies the 
transportation cost inequality {3.$ - If a is only Lipschitz, but not necessarily bounded, then P 
satisfies the inequality &3. 6]) . 

Proof: Let b n be the Yosida approximation of A, i.e., J n = (Ij^d + ^A)^ 1 and — b n = n(I — J n ) 
then b n is dissipative and Lipschitz, hence the law of the solution of the SDE 

dX? = a(t, X t )dW t + b n (X' t l )dt + m{Xl l )dt 

satisfies the inequality p. 41) with the constants independent of n, moreover the law of (X n , n £ IN) 
converges weakly to P (cf. [1]), hence P satisfies also the inequality (|3.4I) due to Lemma [TJ □ 

As an example of application of this theorem, let us give 
Theorem 3. Let P be the law of the solution of the following SDE: 

dXi=m{X i t )dt + a{Xl)dWi+ 1 ]T — ^— dt , i = 1, . . . , d, 

i<jjti<d x t x t 

with a bounded and Lipschitz, 7 > 0. Then P satisfies the transportation cost inequality \3.J$ and 
if a is not bounded but only Lipschitz, then P satisfies the inequality A3. 6]) . 

Proof: It suffices to remark that the drift term following 7 is the subdiffcrcntial of the concave 
function defined by 

F(x) = 7 log(x j - x l ) 

i<j 

if x 1 < x 2 < . . . < x d and it is equal to —00 otherwise. □ 

Remark: For details about the equation of Theorem [3] cf. [5] . Moreover Theorem [2] is applicable 
to all the models given in [3]. 
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